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CLAIMS 

1. A Method of processing a natural language 
nput, comprising: 

identifying N-best part-of-speech (POS) sequences 

corresponding to the natural language input; 
identifying a likely base noun phrase (baseNP) 
sequence based on the N-best POS sequences 
identif\ed; and 
outputting tlie likely baseNP sequence. 



2. The method\ of claim 1 wherein identifying a 
likely baseNP sequence comprises: 

identifying a plurality of baseNP sequences for 
each of the Nfr-best POS tag sequences; and 

calculating which\ of the plurality of baseNP 
sequences is mast likely. 

3. The method of claim 2 wherein calculating 
which of the plurality o^ baseNP sequences is most 
likely comprises: 

calculating a likely baseNP sequence that is most 
likely based on lexical information indicative 
of a position of words in the natural language 
input relative to bas^NPs identified in the 
baseNP sequences. 



4 . ' The method of claim 3 Wherein calculating a 
likely baseNP sequence that is rr^ost likely '-based on 
lexical information comprises: 
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calculating a likely baseNP sequence that is most 
likely based on lexical information indicative 
of POS flags assigned to the words in the 
natural language input. 



5. The method\ of claim 3 wherein calculating a 
likely baseNP sequence comprises: 

• calculating a likely baseNP sequence based on the 
lexical information for every word in the 
natural language input. 

6. The method of claim 2 wherein the natural 
language input comprises a sentence and wherein 
calculating which of ttie plurality of baseNP sequences 
is most likely comprises: 

calculating which! of the plurality of baseNP 
sequences is \ most likely over the entire 
sentence . 

7. The method of blaim 3 wherein a baseNP rule 
comprises a sequence of ffOS tags corresponding to words 
in the natural language 1 input identified as a baseNP 
and wherein calculating\ a likely baseNP sequence 
comprises : 

calculating a probability of POS tags and baseNP 
rules, given their context. 



8. The method of claim 7 wherein calculating a 
probability of POS tags and IbaseNP rules comprises: 
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calculating the probability of POS tags and baseNP 



rules given n 



/ 



prior POS tags or baseNP rules 



9. The method of claim 1 wherein identifying a 
likely baseNP sequence includes: 

calculating a probability of each of the N-best 
POS sequences \ given the natural language 
input . 

10. A natural language processing system for 
processing a natural language input, comprising: 

a base noun phra.se (baseNP) identifier configured 
to receive N-best part-of -speech (POS) tag 
sequences for the \ natural language input and 
identify a likely \baseNP sequence of baseNPs 
corresponding to tne natural language input, 
given the N-best POS\ tag sequences. 

11. The system of claim 10 and further comprising: 
a POS tagger, coupled to\ the baseNP identifier, 

receiving the natural language input and 
calculating the N-best POS tag sequences 
corresponding to the natural language input. 

12. The system of claim lu. wherein the baseNP 
identifier is configured to identify a plurality of 
baseNP sequences for each of the POS tag sequences and 
calculate which of the plurality o\ baseNP sequences is 
the likely baseNP sequence 
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13. The system of claim 12 wherein the baseNP 
identifier further comprises: 

a unified statistical model that includes lexical 
information indicative of a position of words 
in the natural language input relative to 
baseNPs identified in the baseNP sequences. 



14. The system of claim 13 whierein a baseNP rule comprises 
a sequence of POS tags corresponding to words in the 
natural language input that are identified as a baseNP and 
wherein the unified statistical model includes a baseNP 
rule component for calculating a probability of POS tags 
and baseNP rules, given contextual information. 



15. The system of claim 
component is configured to ca 
tags and baseNP rules, given 
rules . 



4 wherein the baseNP rule 
Lculate the probability of POS 
n prior POS tags and baseNP 



16. The system of c 
language input comprises 
unified statistical model 
which of the plurality o 
likely over the entire sentience 



17. A method of processing a 
identifying N-best p 

corresponding to t$e 



aim 15 wherein the natural 
a sentence and wherein the 
s configured for calculating 
f baseNP sequences is most 



linguistic input, comprising: 
rt-of -speech (POS) sequences 
linguistic input; 
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identifying one or more base noun phrases (baseNPs) 
for each of the N-berat POS sequences to form a 
plurality of different possible baseNP sequences 



corresponding to ea 
each baseNP sequence 



likely baseNP sequer ce based on a probability of 



the associated POS 
the baseNP sequenc 
indicative of a 
linguistic input 
identified in the bat 



h of the POS sequences ; for 
, identifying whether it is a 



equence and a probability of 
5, given lexical information 
position of words in the 
relative to the baseNPs 
seNP sequence; and 



outputting the likely baseNP sequence identified, 



18. The method of claim 17 wherein identifying one or more 
baseNPs for each of the N-besp POS sequences comprises: 

identifying baseNP rules for each of the N-best POS 



sequences, 



the baseNP rules 



19. The method of claim 
each baseNP sequence is 
comprises: 

calculating a probab 
rules, given n 



comprising 



sequence of POS tlags corresponding to words in 
the linguistic input identified as a baseNP. 



in the POS sequence. 



20. The method of claim 
comprises a sentence and fw 
baseNP sequence is a likely 



48 wherein identifying whether 
a likely baseNP sequence, 

:_lity of POS tags and baseNP 
rior POS tags or baseNP rules 



9 wherein the linguistic input 
herein identifying whether each 
baseNP sequence comprises: 
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identifying whether/ each baseNP sequence is a 
likely baseN^ sequence over the entire 
sentence . 
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